CombinGym: a benchmark platform for machine learning-assisted design of combinatorial protein variants
This paper introduces CombinGym, a benchmark platform featuring 14 curated datasets and a comprehensive evaluation of machine learning algorithms to address the gap in combinatorial protein design, demonstrating that leveraging lower-order mutation data significantly improves the prediction and experimental engineering of higher-order protein variants.